Math Indexer and Searcher under the Hood: History and Development of a Winning Strategy

نویسندگان

  • Michal Ruzicka
  • Petr Sojka
  • Martin Líska
چکیده

This paper describes and summarizes experience of Masaryk University Math Information Retrieval team (MIRMU) with the mathematical search developed and performed for the NTCIR-11 Math-2 Task. Our approach is the similarity search based on canonicalized MathML and second generation of scalable full text search engine Math Indexer and Searcher (MIaS) with attested state-of-the-art information retrieval techniques like query expansion. The capability of MIaS system in terms of math query notation, normalization and combining math with textual query tokens was deployed by submitting multiple runs with four query notations provided, and with results merged from multiple queries. The analysis of the evaluation results shows that the system performs best using TEX queries that are translated and canonicalized to Content MathML, where MIaS ranked as #1 for all metrics returning very relevant results.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Math Indexer and Searcher under the Hood: Fine-tuning Query Expansion and Unification Strategies

This paper summarizes the experience of Math Information Retrieval team of Masaryk University (MIRMU) with the NTCIR-12 MathIR arXiv Main Task and its subtasks. We based our approach on the MIaS system. Based on NTCIR-11 Math-2 Task relevance judgements, we developed an evaluation platform. Using this platform we rigorously evaluated combinations of new features and picked the most promising on...

متن کامل

Indexing and Searching Mathematics in Digital Libraries

This paper surveys approaches and systems for searching mathematical formulae in mathematical corpora and on the web. The design and architecture of our MIaS (Math Indexer and Searcher) system is presented, and our design decisions are discussed in detail. An approach based on Presentation MathML using a similarity of math subformulae is suggested and verified by implementing it as a math-aware...

متن کامل

Similarity Search for Mathematics: Masaryk University Team at the NTCIR-10 Math Task

This paper describes and summarizes experiences of Masaryk University team MIRMU with the mathematical search performed for the NTCIR pilot Math Task. Our approach is the similarity search based on enhanced full text search utilizing attested state-of-the-art techniques and implementations. The variability of used Math Indexer and Searcher (MIaS) system in terms of the math query notation was t...

متن کامل

Identifying the pattern of the talent management as the winning strategy of the organization; A study in the National Iranian South Oil Company

This study was conducted to identify the dimensions, components and indices of the talent management in the National Iranian South Oil Company. The study has been considered an applied research in terms of its purpose and in terms of data was qualitative and it has been done based on grounded theory in terms of the nature of the implementation. The statistical population of the study was expe...

متن کامل

A GA Model Development for Decision Making Under Reverse Logistics

  Managing products’ end-of-life and recovery of used products is gaining significant importance during last years. Therefore, managing the reverse flow of products can be an important potential for winning consumers in future competitive markets. In this context, establishing reverse logistics networks is becoming a main problem in reverse supply chains. Genetic Algorithm (GA) is utilized to s...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014